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ABSTRACT 

We present an analytic model for the local bias of dark matter hales in a ACDM universe. The model 
uses the halo mass density instead of the halo number density and is searched for various halo mass 
cuts, smoothing lengths, and redshift epoches. We find that, when the logarithmic density is used, 
the second-order polynomial can fit the numerical relation between the halo mass distribution and the 
underlying matter distribution extremely well. In this model the logarithm of the dark matter density 
is expanded in terms of log halo mass density to the second order. The model remains excellent for all 
halo mass cuts (from Mcut = 3 x 10^"'^ to 3 x lO^^/i~^M0), smoothing scales (from R = 5/i~^Mpc to 
50/i~^Mpc), and redshift ranges (from z = to 1.0) considered in this study. The stochastic term in 
the relation is found not entirely random, but a part of the term can be determined by the magnitude 
of the shear tensor. 

Subject headings: methods: N-body simulations - methods: numerical - galaxies: halos - cosmology: 
theory - dark matter - large-scale structure of Universe 



1. INTRODUCTION 

Understanding the formation and distribution of galax- 
ies requires knowledge on initial conditions, effects of 
gravitational evolution, dark matter halo formation, and 
galaxy formation in dark halos. The standard paradigm 
is to adopt Gaussian initial density fluctuations that grow 
through gravitational instability to form dark matter ha- 
los. Galaxies are thought to form and evolve in dark ha- 
los, which in turn undergo a series of mergers and accre- 
tion (White & Rees 1978). It is therefore expected that 
the observed galaxy distribution is somewhat different 
from the halo distribution and also from the underlying 
matter distribution even though it is popularly assumed 
to trace the dark matter with a constant bias on very 
large scales. In many future surveys of galaxy redshifts 
it is hoped to measure cosmological parameters with a 
high precision from the galaxy distribution, and it is re- 
quired to know the relations among matter density field, 
distribution of dark matter halos and galaxy distribution 
very accurately. 

Cosmological N-body simulation is a useful tool that 
can be used to find the relation between matter and halo 
distributions, which we will focus on in this paper. Man- 
era & Gaztanaga (2011) recently studied a local halo 
bias model using an N-body simulation. They adopted 
a non-linear, local, deterministic bias model of Fry & 
Gaztanaga (1993) and Taylor-expanded the halo num- 
ber density contrast rih as a second-order polynomial of 
the matter fiuctuation 5m- The coefficients of the linear 
and quadratic terms are measured as functions of cubi- 
cal pixel size and halo mass cut. In this paper we extend 
their work to obtain a halo bias model that describes 
the local halo-matter density relation in a more univer- 
sal way. There are many other studies which adopted 
this model ; Guo & Jing (2009) calculated galaxy bias 
up to the second order from bispectrum based on this 
model. Roth & Porciani (2011) applied the Eulerian lo- 



cal bias(ELB) model and compared the results between 
Standard Perturbation Theory (SPT) and simulations. 
It has proved that SPT is a good theoretical model in pre- 
dicting the bias at fairly large smoothing scale. However, 
since the growth of the structure is not deterministic, it 
cannot provide a full picture in estimating bias. Pollack 
et al. (2011) used four different probes to estimate bias: 
smoothed density field, power spectra, bispectra and re- 
duced bispectra. They claimed that the Fourier-space 
analysis is more reliable since it is free from the mode- 
mixing problem of the real-space analysis. I n this study 
we applied smoothing scale dependent bias to get rid of 
this problem. 

We first adopt to use the halo mass density instead of 
the halo number density in the halo bias model. This is 
because the halo mass density has a much tighter and 
simpler relation with the underlying matter density than 
the halo number density (see Fig. 1 in section 3.1 be- 
low). Seljak et al.(2009) reported that weighting central 
halos by their mass can significantly reduce stochasticity 
relative to the dark matter below the Poisson expecta- 
tion. Park et al. (2010) showed that the gravitational 
shear calculated from the halo mass density (calculated 
from the halo distribution by weighting halos with their 
mass) has a much tighter relation with the true shear field 
of dark matter than that from the halo number density 
(halos are uniformly weighted above a certain mass cut). 
Observational application of the halo mass-weighting to 
local density estimation has been already made by Park 
et al. (2008), with an assumption that luminous galaxy 
mass is proportional to halo mass (see also Park & Choi 
2009, Park & Hwang 2009, Hwang & Park 2009 for more 
applications). 

Second we use the logarithmic density ln(l-|-5) instead 
of the density contrast 5 in our halo bias model. It turns 
out that using the logarithmic density allows us to find 
a very simple analytic bias model that are good for all 



2 



Jee, et al. 



smoothing scales, halo mass cuts, and rcdshifts consid- 
ered. There have been many suggestions for using the 
logarithmic density as a model for weakly non-linear den- 
sity field in cosmology (Cole & Jones 1991; Colombi 1994; 
Kayo et al. 2001; Neyrinck et al. 2009). Cole & Jones 
(1991) has proposed to use a lognormal model for the 
matter distribution evolved from Gaussian initial condi- 
tions. They provided astrophysical motivations for con- 
sidering the lognormal model. In particular, they showed 
that the mass flow governed by the continuity equation 
leads the density fleld to a lognormal distribution in the 
non-linear regime if the velocity fluctuation is assumed 
linear. Kayo et al. (2001) showed through a comparison 
with N-body simulations that the lognormal probability 
distribution is a useful empirical model for the cosmolog- 
ical density fluctuations, which is insensitive to the shape 
of the density power spectrum. Neyrinck ct al. (2009) 
found that nonlincaritics in the dark matter power spec- 
trum are almost absent when the density is transformed 
to the log density. They also provided several reasons to 
use the logarithmic density mapping. 

Third, we expand the matter density in terms of the 
halo density, which is opposite to all previous studies. 
This is required by the simulation data. The log matter 
density can be flt excellently by a second-order polyno- 
mial of the log halo mass density, not vice versa. A model 
of the form S„i = f{Sh) is in practice useful when Sh is 
an observed quantity and 6m is the one to be estimated. 

As in Mancra & Gaztanaga (2011) we restrict our 
study of the halo biasing in the real conflguration space. 
The effects of the redshift-space distortion will be of prac- 
tical interest, but an observed distribution of galaxies in 
redshift space can be first corrected for the fingers-of-god 
by shrinking massive clusters and groups of galaxies and 
for the large-scale peculiar velocity by using the second- 
order perturbation theory (Gramann et al. 1994) before 
the halo bias model in real space is applied. We will 
briefly discuss the halo bias in Fourier space in section 5. 

2. N-BODY SIMULATION 

2.1. Simulation 

The simulation used in this paper is based on the 
Wilkinson Microwave Anisotropy Probe (WMAP) three- 
year parameters (Spcrgcl ct al. 2007), Ha = 0.762, 
flm = 0.238, = 0.042, = 0.958, h = 0.732, and 
as = 0.761, where Qa, fi^, and fif, are the density pa- 
rameter associated with the cosmological constant, mat- 
ter, and baryon, respectively, rig is the slope of the pri- 
modial power spectrum, and as is the rms fluctuation of 
the matter density fleld smoothed with a 8 ft,~^Mpc ra- 
dius top-hat sphere. The simulation evolved 2048"^ cold 
dark matter particles with mass 9.6 x 10^h~^MQ, and 
the minimum halo mass was 2.9 x lO^^h^^ Mq (30 parti- 
cles) . Initial conditions were generated on a 2048^ mesh, 
in accordance with the ACDM power spectrum (Eisen- 
stein & Hu 1998). The physical size of the simulation 
cube is 1024/i~^Mpc. The initial epoch of the simulation 
was z — 47, and 1880 global time steps were taken until 
the simulation reaches the present epoch. To increase 
the spatial dynamic range we use a parallel N-body code 
made by Dubinski et al. (2003, 2004) which is a merger 
between a PM code (Park 1990, 1997) and a tree-code 
(Barnes & Hut 1986). 



2.2. Dark Halos 

Since our study compares the halo distribution with 

the distribution of the underlying dark matter, the way 
halos are deflned can be a criicial problem. The most 
widely used method, the fricnds-of-fricnd (FoF) algo- 
rithm, provides an easy way to identify virialized halo 
regions using a particle linking length parameter (Audit 
et al. 1998; Davis et al. 1985). Matching the FoF ha- 
los with the observed galaxies is unrealistic because each 
FoF halo can contain many galaxies. A better model is 
to use subhalos each of which is assumed to contain one 
galaxy. In this model galaxy mass or luminosity is as- 
signed to halos with the constraint that the halos with 
mass above a certain limit have the number density equal 
to that of the galaxies with stellar mass or luminosity 
above a certain limit. This one-to-one correspondence 
model or abundance matching method of galaxy assign- 
ment scheme has been shown to be quite successful in 
describing the observed galaxy distribution (Marinoni & 
Hudson 2002; Vale & Ostriker 2004, 2006; Shankar et 
al. 2006; Kim et al. 2008; Gott et al. 2009; Choi et al. 
2010). 

Below we will show that the matter density fleld has a 
much higher correlation with the halo mass density fleld 
than the halo number density field. If the subhalo mass 
is weighted to each halo, the mass density fields of the 
FoF halos and of the subhalos will be the same and it is 
not important which halo identification scheme is used. 

To identify subhalos, the physically self-bound (PSB) 
group- finding algorithm developed by Kim & Park (2006) 
and Kim et al. (2008) is used. The PSB method first 
finds local particle groups using the FoF algorithm and 
convert the particle distribution in each group into a den- 
sity field measured at particle positions using a variable- 
size Spline kernel. The particles in each group are as- 
signed into subgroups near the density peaks. The sub- 
group particles are used to redefine the subgroup member 
particles to identify the tidally stable and gravitationally 
self-bound subhalos through iteration. The PSB group- 
finding method has its advantage in resolving halos even 
in dense environment, and can resolve subhalos down to 
the gravitational force resolution. It should be noted 
that subhalos are not substructures of the host halo, but 
immigrated objects that are not yet disrupted within the 
host and thus should be resolved and treated separately 
when individual self-gravitating non-linear objects are to 
be identified. After flnding the FoF halos using a stan- 
dard linking length, the PSB algorithm is applied to iden- 
tify subhalos. In each FoF halo the most massive subhalo 
is named the central or main subhalo, and the rest are 
called subhalos. In this study, we hereafter use the term 
'halo' for the both kinds of subhalos. 

3. MATTER DENSITY-HALO MASS DENSITY 
RELATION 

3.1. Halo mass density field 

When the spatial clustering of galaxies is studied, tra- 
ditionally galaxies are uniformly weighted or weighted 
only according to the selection function. The resulting 
number density fleld is used to calculate clustering ampli- 
tude statistics such as the two-point correlation function 
and the power spectrum (Davies & Peebles 1983; Park et 
al. 1994). The uniform weighting is still widely used to 
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Fig. 1. — The relation of the matter density fluctuation <5„i with 
the halo number density fluctuation (upper panel) and halo mass 
density fluctuation (lower panel). The relations are obtained on a 
Gaussian smoothing scale of 5/i~^Mpc. The solid line in the lower 
panel is a second-order polynomial flt to the data. 

study the non-hnearity, scale-dependence, and bias in the 
distribution of galaxies (Tegmark et al. 2006). The main 
reason for using the uniform weighting is to reduce the 
shot noise. If galaxy luminosity is used as weight, for ex- 
ample, the resulting luminosity density field is dominated 
by the rare brightest objects. However, it has been re- 
cently shown that the halo mass density field has a much 
tighter correlation with the underlying matter field than 
the halo number density field (Park et al. 2010), and the 
uniform weighting cannot be justified because the bias 
function of the galaxy number density field is not only 
more complicated but also less correlated with the mat- 
ter density field compared to the galaxy mass density 
field as we show in Figure I. 

In Figure 1 we demonstrate that the halo mass density 
field has a tighter relation with the matter density field 
than the halo number density. The y-axis in the upper 
panel of Figure I is the overdensity in the halo number 
density, and that in the lower panel is the overdensity in 
the halo mass density. The density fields are smoothed 
with a Gaussian filter with radius of i? = 5ft.^^Mpc. It 
can be seen that the halo mass density has a relation with 
the underlying matter density not only much tighter but 
also much simpler than the halo number density. The 
disadvantage of using the number density is even more 
serious if the FoF halos are used because the number 
density of the FoF halos is relatively low in high density 
regions. 

3.2. The second-order bias model 

Even though the relation with the matter density be- 
comes much tighter and simpler when the halo mass den- 
sity is used, it can not be fit by a low-order polynomial. 
The line in the lower panel of Figure 1 is a second-order 
polynomial, (5/i = 6o + bi5m + ^2^m, best fit to the nu- 
merical relation. It can be seen that the formula can not 
fit the data at low densities. We also found that, even if 



3 

higher-order terms are added, fitting by a polynomial is 
not always successful at all redshifts. This reveals a sig- 
nificant limitation of the bias model of Fry & Gaztanaga 
(1993) working only in a very weak non-linear regime. 

We find that, when the logarithmic density is used, a 
second-order polynomial relation between the halo mass 
density and the matter density stands almost universally. 
We first smooth the mass- weighted halo distribution and 
the matter density field with a Gaussian filter of radius 
R and take the logarithm of the resulting smooth den- 
sity fields D — ln(I -|- 6). The logarithmic transforma- 
tion can make the result diverge in void regions. Since 
the smoothing length is always chosen to be equal to 
or greater than the mean halo separation, the Gaussian 
smoothing avoid this situation in practice. Note that, 
when I (5 1 <^ I, D approaches 5 and our model reduces 
to the conventional linear bias model. The scatter plots 
in Figure 2 are the relations between Dh = ln(l -I- 5h) 
(x-axis) and Dm = \n{l + 5m) (y-axis) at redshifts z — Q 
(left column), 0.5 (middle column), and I.O (right col- 
umn). The smoothing scales are i? = 5 (top row), 15 
(middle row), and 50 ft.~^Mpc (bottom row). Our model 
for these numerical relations is 

Dm^Po+PiDh + li2Dl, (1) 

where jSi 's are functions of the smoothing scale R and the 
halo mass cut Mcut- It should be noted that the matter 
density is expanded in terms of the halo density. The 
solid lines in Figure 2 are second-order polynomials best 
fit to the scatter plots for the halos with mass more than 
3 X IO"/i"^M0. For higher mass cuts of M^t = 1 x 10^^ 
(long-dashed lines) and 3 x IQ^'^H'^Mq (dotted lines) 
we show only the best fits without scatter plots to avoid 
confusion. Since the density fields should be estimated 
from discrete points like halos or galaxies, the shot noise 
can be an issue. We choose the smoothing length R equal 
to or greater than the mean halo separation to reduce 
the shot noise. As a result for the halo sample with 
M^^t = 3 X fO"/i-iM0(d = 4.62/i-iMpc) we study the 
halo bias on the scales R > 4.62/i~^Mpc. Likewise for 
a sample with Mcut = f or 3 x lO^^/i^^M0((i = 6.56 or 
9.56/i~^Mpc) the halo bias is studied on scales R > 6.56 
or 9.56/i^^Mpc, respectively (see Weinberg et al. 1987 
and Park et al. 2005 for choice of the smoothing length 
for the topology study of galaxy distribution) . 

3.3. Dependence on smoothing scale and halo mass cut 

The coefficients /3i's in Equation (1) depend on 
smoothing length and halo mass cut. Figure 3 shows the 
dependence of (3i on R and Mcut at three redshifts. In 
this paper, all the smoothings are done using the Gaus- 
sian filter, as stated in section 3.2. The zero-point offset 
(3o of the halo mass density-matter density relation is 
significant on small scales, but rapidly vanishes as R in- 
creases. The zero-point off-set is also seen in the Sm-Sh 
relation. On the other hand, the coefficient of the second 
order term does not decrease much as R increases, and 
Equation (1) remain quadratic even on very large scales. 
Equation (1) can be approximated by a linear bias model 
Sfi = bi6m on large scales not because the equation be- 
comes linear but just because the ranges in Dh and Dm 
become small. This can be seen in Figure 2, where the 
shape of the fitting curves hardly changes as R increases. 
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Fig. 2. — The relations between the logarithmic halo mass density and the underlying logarithmic matter density for different mass cuts 
and smoothing scales at different redshifts. The panels in the first, second and third row are for Gaussian smoothing length of 5, 15 and 
50 h~^Mpc, respectively. Those in the first, second and the third column are the relations at z=0, 0.5 and 1, respectively. Also shown are 
the relations for the cases of three halo mass cuts of 0.3 (solid line), 1.0 (long-dashed line), and 3.0 (dotted line) in units of 10^^h~^ Mq 
best fit to the data. 



We fit the values of /3i's at z = over the R ranges 
from 5 to 50/i~^Mpc and the Mcut range from 0.3 x 10^^ 
to 3.0 X 10^^/i~^AfQ using a routine obtained from 
http: / / www.zunzun.com, The functional forms of the 
coefficients we use are 



f3o = - 



a + bMr, 



cR + dMcutR 



1 



/Afcut 

a - 



-gR + 
bMcut 



+ cR + dMcutR 



(2) 
(3) 

1 + / In(Meut) + g HR) + h In(Meut) ln(i?) ^ 
Table 1 lists the best-fit values of the constants in these 
functions when the halo mass is in units of 10^'^h~^MQ. 
The parameter values in this table are useful only for 
the cosmological model we adopted. A simulation us- 
ing slightly different cosmological parameters would give 
slightly different /3's. The aim of the current study is to 



TABLE 1 

Coefficients of the second-order bias model at redshift 
z = 0. 
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c 
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9 


h 


i 




-0.297 


1.85 


1.77 


0.87 










132 


0.056 


7.15E-3 


-6.66E-4 


-7.32E-5 


0.175 


-9.39E-3 


1.08E-4 




/3o 


1220 


11.5 


14.5 


-0.315 


-1310 


4490 


354 


-0.105 



show that the mapping from halo mass density to under- 
lying matter density can be well-modeled by a second- 
order polynomial when logarithmic densities are used. 

3.4. Dependence on redshift 

We find that the redshift dependence of the Dh-Dm 
relation can be accurately modeled through rms value 
matching of the D fields across different redshifts. When 
a second-order Dh-D„i relation is found at one epoch 
zi, the corresponding equation at a different epoch Z2 
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Fig. 3. — The bias factors /3i's in our second-order halo bias model of the logarithmic density. For each halo masscut of 0.3 X lO^^h ^M© 
(solid line), 1 X 10^^ Mq (dashed line) and 3 X lO^^h~^M0 (dotted line) the bias factors are given as a function of the Gaussian 
smoothing radius. The bias factors are inspected at three redshift epoches 2 = 0, 0.5, and 1.0. 



is accurately given if D's are scaled to D(Jd{z2)/ (Td{zi)- 
This is demonstrated in Figure 4. It shows the Dh-Dm 
relations for Mcut = 3 x lO"/i~^M0 and R = 5/i^^Mpc 
at redshifts z = (dotted line), 0.5 (short-dashed line), 
and 1.0 (long-dashed line). The lines for the z = 0.5 
and 1 cases are drawn by scaling the x and y-axes by 
aD[z)/aD{^)- It can be seen that the lines are very close 
to one another once the scaling is made. To make Equa- 
tion (1) applicable at different redshifts we introduce a 
scaled variable A — D(7d{z)/(7d{0) and arrive at a mod- 
ified halo bias model 

A™ = /?o+/3iA;,+/32A2, (5) 

where A™ = DmCrD„(2:)/(TD„(0) and Ah = 
DhO'Dh{z) / cFDhi^)- We emphasize again that the mat- 
ter density is expanded in terms of the halo mass density 
in our model. A's are equal to D's at z = 0. It is re- 
assuring that the redshift dependence of versus 
relation satisfies this scaling relation down to the smooth- 
ing scales as small as 5h~^Mpc. This scaling property 



of our second-order polynomial bias model also makes 
the model applicable to cases with a different degree of 
biasing. 

4. STOCHASTICITY 

At a particular point in space the halo density is not 
completely fixed for a given matter density, but can have 
a range of values as can be seen in Figure 2. This dis- 
persion in the relation is called stochasticity (Seljak et 
al. 2009). Ue-Li Pen(1998) claimed that on large scales 
galaxy density variance, galaxy-dark matter density bias, 
and their cross-correlation coefficient are the only coeffi- 
cients that are required in determining the dark matter 
power spectrum, and in mildly nonlinear regime skew- 
ness and non-linear bias should come into consideration. 
Dekel & Lahav(1999) proposed conditional distribution 
in estimating random fluctuations of galaxy and mass 
density. They claimed that the scatter in this rela- 
tion arises from the hidden factors related to shot noise. 
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Fig. 4. — The relation between D/j = ln(H-(5fi) and Dm = ln(l+ 
5m) at redshift z =0 (dotted line), 0.5 (short-dashed) and 1.0 (long- 
dashed). The lines for z = 0.5 and 1 are scaled by Dcr{z)/(7{0). 



galaxy formation etc. Somerville et al. (2001) modeled 
the non-linear stochastic bias as a combination of mean 
biasing function and scatter about it. It was claimed 
that it depends on halo mass, luminosity and scale. They 
investigated the time dependence of bias in detail, and 
concluded that the evolution of biasing depends largely 
on the cosmological model due to the growth rate. Sel- 
jak & Warren (2004) estimated the halo bias as approx- 
imately constant for halo masses one-tenth below the 
non-linear mass. They also explored the growth of in- 
dividual Fourier modes of dark matter density field and 
found large fiuctuations due to non-linearity. Similar 
large fiuctuations were found between modes of halos 
and dark matter densities, and between modes of halo 
density fields with different halo mass. In a succeeding 
study, Bonoli & Pen (2009) tested the validity of basic 
assumption of halo bias studies that presumes no intrin- 
sic stochasticity between halo and dark matter. From 
the behavior of stochasticity in the correlation between 
halos with different mass and between halos and dark 
matter as a function of mass, they claimed that even on 
large scales, stochasticity cannot be neglected. Roth & 
Porciani (2011) tested the third-order standard pertur- 
bation theory (SPT) by comparing the dark matter field 
produced by simulation with the numerically-calculated 
dark matter field using the SPT from initial conditions, 
both point-by-point and statistically. They claimed that 
on large scales above 8h~^Mpc, they agreed well up to 
redshift 0. To relate the matter field to halo density fields 
identified from the simulation, they applied Eulerian bias 
(ELB) model up to the third order by fitting scatter 
plots. Using this bias they reconstructed the halo field 
to show that it cannot reproduce all the detailed prop- 
erties of halo distribution. On large scales they showed 
that bias can be approximated as a constant and it agrees 
with the parameter bi from the local bias model, in those 
regions with S << 1. As a consequence they concluded 
that the SPT is a good method to study the matter field 
on fairly large smoothing scales, but the ELB model can- 
not fully reconstruct the halo density field. 



Since all the matter in ACDM universes is contained 
in halos. Ah is identical to and the A^-A^ relation 
will be exactly linear when the halo mass cut is zero. 
Since all the matter in the LCDM universes is contained 
in halos in the case of a pure N-body simulation, then 
the stochastic term is thought to be caused by the back- 
ground uncounted halos with mass below the mass cut. 
(In practice, the unresolved halos below the mass cut is 
responsible only for a part of stochasticity since there can 
be other forms of mass components whose distribution 
can be governed by various environmental factors.) To 
reduce the stochasticity Hamaus et al. (2010) suggested 
a simple halo weighting scheme where the weight to a 
halo with mass AI^ is not Mh but Mh + Mcut ■ When we 
implement this weighting scheme into the density calcu- 
lation, we find the 5h-Sm relation slightly changes and its 
dispersion somewhat decreases. We find the reduction of 
the scatter is about 10% in the case of Mcut = 3 x lO^^M© 
and R = 5/i~^Mpc. Since the reduction of stochasticity 
is not significant, we will not adopt this weighting. 

We look for a possibility that some local parameters 
other than the local density have information on the un- 
derlying matter density, and a part of the stochasticity in 
the A/i-Am relation is actually not random but can be 
determined by such environmental parameters. We in- 
vestigate the dependence of the scatter on a few local and 
nonlocal physical parameters. They include the gravita- 
tional shear tensor didj^, Laplacian of the logarithmic 
halo density field V^A, and the difference between the 
densities on two smoothing scales A(i?i) — A(i?2)- The 
mass cut is fixed to A/cut = 3 x 10^^h~^ Mq in this sec- 
tion. 

The halo collapse time and the mass accreted till an 
epoch can depend on various environmental parameters, 
and the halos with mass above Mcut located in the same 
local density regions can have different physical param- 
eters and the shape of the mass function can depend on 
environment. For example, the gravitational shear force 
can infiuence the halo formation under the same density 
environment (McDonald & Roy 2009); see also Chan et 
al. (2012) and Tobias et al. (2012). We calculate the 
traceless Hessian matrix of the gravitational potential 



H, 



dxidxj 3 



from the smooth halo mass density and matter density 
fields. The traceless Hessian matrix is used to retain only 
the anisotropic component of the gravitational potential 
field. Its eigenvalues Xi are used to define the ellipticity 
and prolateness parameters 

e=(Ai-A3)/A, (7) 
p=(Ai -2A2 + A3)/A, (8) 
where Ai > A2 > A3 is assumed and the magnitude of the 
shear tensor A = \/Xl+ X2 + A§. The curvature of the 
gravitational potential field determined the shear tensor, 
which produces the tidal torque on density fluctuations. 
The ellipticity and prolateness quantifies the shape of the 
potential by comparing the size of the curvature in di- 
rections of the principal axes, while the shear magnitude 
gives the size of the curvature itself. The ellipticity and 
prolateness parameters are divided by the shear magni- 
tude so that only the information on the potential shape 
can be obtained. 
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Fig. 5. — The scatter term of ln(l + 5^) from the second-order 
halo bias model as a function of ellipticity, prolateness and shear 
magnitude calculated from the halo distribution. 
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Fig. 6. — The scatter term from the second-order halo bias 
model versus the difference between on two smoothing scales 
(left panel) and the Laplaeian of (right panel). 

Figure 5 is the scatter term in ln(l + 6m) from the 
fitting formula shown in Figure 2 (the panel at the upper 
left corner). It can be seen that the scatter term does 
not depend on the normalized ellipticity or prolateness 
parameters, but weakly depends on the shear magnitude. 
A least-square fit of a line results in 

ln(l + <5™)-ln(l + <5„)fit = 0.035 + 0.0321ogio(A2), (9) 

where the error in the slope is 0.01. 

In the left panel of Figure 6 we show the scatter term 
as a function of the density difference between ln(l -I- Sh) 
with R = 5/i~^Mpc and 6.3/i~^Mpc, a measure of non- 
locality. The smoothing volume in the latter case is twice 
that of the first case. We do not find a statistically signif- 
icant dependence of the scatter on the density difference. 
The right panel shows the scatter term as a function of 
the Laplaeian of the smooth halo mass density. Again 
there is no significant correlation. 

Our results can be compared with the theoretical pre- 
diction of McDonald & Roy (2009) who showed that the 
galaxy density can be Taylor-expanded in terms only of 
6m: ^m: ^P to the second order. To conclude we 

find that the scatter term in ln(l -I- Sm) shown in Fig- 
ure 2 can be partly determined by the magnitude of the 
gravitational shear tensor, and the halo bias model can 
be improved to the following formula 

Am^P0+l3lAh+P2^l+d{X), (10) 

where the extra deterministic term d{X) is given by Equa- 
tion (9) for the case of Mcut = 3 x IG^/i-^Mq and 
R — 5/i^^Mpc. For an observed distribution of dark 
halos Equation (10) gives the estimate of the underlying 
matter density field. 

5. SUMMARY AND DISCUSSION 



We present an analytic model for the local bias of dark 
matter halos simulated in an N-body simulation of the 
ACDM universe. We found that a second-order polyno- 
mial model for the relation between the halo mass dis- 
tribution and the underlying matter distribution is an 
excellent fit to N-body simulation data when the loga- 
rithmic density is used. The model is second-order not 
in the matter density, but in the halo mass density. The 
model remains excellent for all smoothing scales (from 
R = 5h~^Mpc to 50/i~^Mpc), halo mass cuts (from 
Mcut = 3 X 10" to 3 X lO^'^h-'^MQ), and redshift ranges 
(from z = to 1.0) considered. The scatter term in the 
relation between the halo mass density and matter den- 
sity is found not entirely random. We showed that a 
fraction of the scatter can be determined by the magni- 
tude of the shear tensor and the scatter can be reduced. 

Cen & Ostriker (1993) claimed that the following 
second-order polynomial relation between matter and 
galaxies 

log{^)=A + B\og{^) + C[\og{^)r (11) 

("s) {Ptot} {Ptot) 

was an excellent fit to their simulation results of Cen & 
Ostriker (1992). Here Ug is the simulated galaxy number 
density and ptot is the total mass density. It should be 
noted that Equation (11) is completely different from 
our model even though it also adopted a second-order 
polynomial of logarithmic density. First, it uses galaxy 
number density, and it is expected from Figure 1 that the 
galaxy number density is not related with the total mass 
density through a simple polynomial. Second, Equation 
(11) is second-order in log total matter density but our 
model (Eq. 1) is second-order in the halo mass density 
and linear in the matter density. We checked Figure 4 
of Cen & Ostriker (1992) and found the figure in actual 
fact supports our model instead of Equation (11). 

Manera & Gaztanaga (2011) studied a local bias model 
of the halo distribution in N-body simulations. However, 
they also used the halo number density instead of the 
mass density, and adopted a polynomial model where the 
halo overdensity is expanded in terms of the matter over- 
density up to the second order. The mean separation of 
the FoF halos they used was ll/i^^Mpc even in the lowest 
mass halo sample at 2; = 0, and their study was reliable 
only on large smoothing scales of i? > ll/i~^Mpc (or 
cubical cell size > 28/i~^Mpc). Therefore the data was 
not quite appropriate for the halo bias study in the non- 
linear regime. Furthermore, the cubical cell smoothing 
kernel they used produced a large shot noise in the halo 
density, which coupled with the true smoothing length 
dependence of the bias factors. 

Recently, Neyrinck et al. (2009, 2011) found that the 
linear regime in the dark matter power spectrum can be 
dramatically extended to high fc's if a logarithmic map- 
ping or a Gaussianization for the density field is made. 
This is a very useful finding because the size of obser- 
vational data used for primordial parameter estimation 
effectively becomes many orders of magnitude larger by 
simply making such a trivial non-linear transformation. 
We do not inspect the relation between the matter and 
halo densities in Fourier space in detail in this paper, 
but just demonstrate that the linear regime extends to 
much smaller scales if the logarithmic transformation is 
made. Neyrinck et al. (2009) showed the same for the 
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Fig. 7. — The upper pannel is the ratio of the power spectrum 
calculated from the 5 fields, and the lower pannel is the same plot 
but calculated from the ln(l+(5)field, in Fourier space. Red dots 
indicate the ratio between the halo power spectrum and matter 
power spectrum. Blue dots are the ratio between the matter and 
initial (Gaussian random field) power spectrum to show how the 
evolved matter field is deviated from the initial power spectrum. 
The black solid lines are the ratios between the halo and matter 
power spectra that show the scale-dependent halo biasing in Fourier 
space. The horizontal lines are constant bias relations. 



are much sparser and their spatial distributions should 
be smoothed to give smooth density fields. To obtain 
a reliable density estimation we adopt to use a Gaus- 
sian smoothing length equal to or greater than the mean 
halo separation. Figure 7 shows the behavior of the halo 
bias in terms of the power spectrum of the smoothed 
halo mass density fields. The black solid line in the up- 
per panel indicates that the halo mass power spectrum 
(-Phaio = (I'^ID) deviates from the linear relation with 
the dark matter power spectrum Pmattor starting from 
k w 0.05/iMpc^^. However, this scale becomes about 
k « 0.2ft,Mpc~^ as shown in the lower panel when loga- 
rithmic values are used. The scale of the nonlinear halo 
bias is effectively reduced by a factor of about 4. We 
have calculated this halo bias function using another sim- 
ulation with the WMAP 5-year cosmological parameters 
and different random initial conditions, and found this 
conclusion remains valid. It is expected that the linear- 
bias regime can be extended to even smaller scales if one 
can apply a smaller smoothing for a denser sample. 

Gaussianization has a similar effect on the density field 
(Neyrinck et al. 2009, 2011). It should be noted that 
Gaussianization has long been used for the study of the 
large-scale topology of galaxy distribution in order to 
examine the primordial non-Gaussianity (Weinberg et al. 
1987; Choi et al. 2010). 



dark matter field, but we show that the same is true even 
if the halo mass density field is used. 

Each N-body simulation gives the dark matter den- 
sity down to very small scales, and the log density can 
be calculated. But in practice, galaxies and dark halos 
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